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ABSTRACT 


The  purpose  of  this  work  is  to  formulate  and  analyze  the  quad- 
ratic cost  problem  associated  with  a bilinear  system.  The  existence 
of  optimal  controls  to  this  problem  is  established.  A class  of 
commutative  bilinear  systems  and  the  minimum  energy  problem  associa- 
ted with  it  are  investigated.  It  is  shown  that  optimal  controls 
for  this  problem  are  in  constant  form  which  are  determined  by  the 
boundary  conditions.  Sufficient  conditions  for  uniqueness  of  opti- 
mal controls  are  also  derived  for  this  problem.  This  system  is 
shown  to  be  reachable  to  a given  state  x1  by  a constant  input  if 
and  only  if  it  is  reachable  to  x^  by  a time-varying  input.  This 
gives  the  control  engineer  a great  deal  of  flexibility  in  the  design 
of  a feasible  easily- implemented  controller.  Application  to  a two- 
dimensional  pursuit-evasion  problem  is  considered  in  which  the  mini- 
mum energy  problem  with  terminal  constraint  is  solved  analytically. 
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I.  Introduction 

The  linear  quadratic  regulator  problem  has  been  studied  in- 
tensively by  modern  control  engineers  and  many  fruitful  results 
have  been  obtained  due  to  its  simple  structure.  The  existence 
and  uniqueness  of  an  optimal  control  to  this  linear  regulator 
problem  is  well  known , [1 , 2].  The  optimal  control  of  nonlinear 
systems  was  studied  by  Lee  and  Markus  [2]  and  Athens  and  Fa lb  [3] 
in  which  the  existence  and  uniqueness  of  an  optimal  control  to 
the  quadratic  cost  problem  were  obtained  for  some  nonlinear  systems 
with  a control  constraint.  Their  results  have  applications  to  cer- 
tain classes  of  problems  including  the  time  optimal  control  of  a 
class  of  bilinear  systems  with  control  constrained  to  a compact  set 
as  reported  in  [4]  and  [5].  But,  in  general,  the  conditions  are 
not  known  for  the  existence  and  uniqueness  of  an  optimal  control  to 
the  quadratic  cost  problem  without  control  constraints  for  bilinear 
systems . 

Bilinear  systems  have  received  attention  in  recent  years  due 
in  part  to  their  applications  in  various  engineering,  biological, 
as  well  as  socio-economics  systems,  Mohler  [6],  and  in  part  to 
their  intrinsically  nearly  linear  structure.  In  fact,  the  bilinear 
systems  not  only  can  represent  those  physical  systems  containing 
a bilinear  mode,  but  also  have  potential  applications  in  the  modeling 
of  systems  containing  sinusoidal  nonlinearities,  especially  those 
arising  in  spatial  flight  systems.  This  observation  motivated  the 


study  of  the  bilinear  regulator  problem  discussed  in  this  paper. 

The  existence  of  optimal  controls  to  this  problem  is  established 
in  Section  II.  The  approach  used  here  is  similar  to  that  of  Lee  [2] 


for  linear  systems  but  extends  it  in  conjunction  with  the  concept 
of  attainable  set  for  bilinear  systems. 

A class  of  bilinear  systems  in  which  the  system  matrices  commute 
with  each  other  was  studied  by  Sussmann  [7].  It  was  shown  that 
the  attainable  set  is  closed  relative  to  "bang-bang”  input  functions 
over  a given  interval  [0 ,T] . Here,  the  minimum  energy  control  of  such 
cummutative  bilinear  systems  is  investigated  in  Section  III.  Suffi- 
cient conditions  for  the  uniqueness  of  solutions  to  the  two-point 
boundary  value  problem  stemming  from  the  Pontryagin  Maximum  Principle 
will  be  derived.  This  result  assures  that  solving  a nonlinear  al- 
gebraic equation  will  lead  to  the  optimal  solution  to  this  problem. 

That  is,  the  optimization  problem  is  reduced  from  an  infinite  dimen- 

/ 

sional  to  a finite  dimensional  problem.  Furthermore,  it  is  shown 
that  the  optimal  controls  for  the  minimum  energy  problem  associated 
with  the  commutative  bilinear  system  appear  in  a very  simple  form, 
that  is,  a constant  vector  determined  by  the  boundary  conditions. 

The  reachability  property  for  this  class  of  bilinear  systems  is  also 
discussed.  Me  will  show  that  for  a commutative  bilinear  system,  a 
terminal  state  is  constant  reachable  if  and  only  if  it  is 

reachable  by  a time-dependent  control.  This  fact  provides  a great 
deal  of  insight  in  implementing  a feasible  controller  for  this  kind 
of  system  from  a design  point  of  view.  It  will  also  be  noted  that 
for  this  class  of  bilinear  systems,  the  reachable  zone  is  much  easier 
to  characterize  because  the  bilinear  system  beoomes  a linear  time 
invariant  parameterized  system  when  a constant  input  uQ  is  imposed 
as  a parameter  vector.  One  need  only  compute  the  corresponding 


transition  matrices  9(t,t0iuc^),  i - l,...m,  which  characterise 
the  reachable  sone  of  the  bilinear  system, independent  of  the 
initial  conditions. 

Application  to  a two-dimensional  pursuit-evasion  problem  is 
considered  in  Section  IV.  This  problem  has  been  studied  for  many 
years  by  using  different  formulations  and  control  schemes.  Among 
these,  Deyst  [8]  applied  the  theory  of  optimal  stochastic  control 
to  derive  a sophisticated  nonlinear  control  which  lead  to  a signi- 
ficant improvement  in  performance  in  relation  to  those  resulting 
from  the  conventional  proportional  navigation  guidance  law.  Mean- 
while, the  theory  of  differential  games  was  considered  by  Ho  [9] 
and  Raj an  [10]  to  determine  optimal  strategies  for  this  problem. 
Slator  [11]  took  the  perturbation  point  of  view  in  applying  optimal 
control  theory  to  this  problem.  A direct  approach  using  a bilinear 
regulator  formulation  is  considered  here  which  takes  advantage  of 
the  merits  of  a commutative  bilinear  system  as  mentioned  in  Section 
III  to  analyse  the  nonlinear  problem  directly.  A closed  form  solu- 
tion to  this  problem  is  obtained.  It  has  a great  deal  of  potential 
to  be  implemented  in  practice  due  to  the  structural  constancy  of 
the  controller. 


II.  Bilinear 


ilator  Problem 


Consider  the  single-input  bilinear  system1 


[A+Bu(t)]x,  x(tQ) 
t e [t0,T] 


*0  e R 


Notice  that  a direct  input  term  Cu  can  always  be  absorbed  into 
the  bilinear  term  Bux  by  introducing  an  additional  state,  x_. , - 1, 
to  obtain  this  exoression.  n A 


r 

I 


1 


4 


where  A and  B are  n x n constant  matrices;  u(t)  is  a square 
integrable  function  defined  on  [tQ,T] ; xQ  i8  the  initial  state. 

We  shall  first  study  some  basic  properties  of  the  single-input 
bilinear  system  associated  with  a minimum  energy  coat,  then  general- 
ize this  to  the  multi-input  case  together  with  the  general  quadratic 
cost. 

The  problem  is  to  find  an  input  function  u*  e L2 ( [ t q ,T] ,R) , 
the  class  of  square  integrable  functions  defined  on  [tQ,Tj,  which 
minimises  the  following  cost  function 

J(u)  * x'  (T)Qx(T)  + 


tT 


(t)dt 


(2) 


where  Q is  a n x n nonnegative  definite  symmetric  constant 
matrix  and  prime  denotes  the  matrix  transpose  operation. 

Define  the  set  of  attainability  K of  (1)  and  (2)  by 


K-  { (x° (u) fx(T;u) ) e Rn+1sx°(u)  = u2 (t)dt,x(Tju)  = • (T,t0;u)x(), 


u e L2Ut0,T],R)> 


(3) 


where  #(t,tQ;u)  is  the  state  transition  matrix  of  (1)  for  each 
given  u.  It  is  clear  that  K consists  of  the  pairs  of  cost  and  - 
terminal  state  corresponding  to  all  admissible  input  functions  as 
coordinates . 


Now  consider  the  map  GtK  -*■  [0,<»)  defined  by 


_ . 


G(x  (u)  ,x(T;u) ) = x'  (T;u)Qx(T;u)  + x°(u) 


Every  subset  N(p)  - { (xu  (u)  ,x(T;u) ) e K:  0 < x°(u)  < p,  p < 
of  K is  compact. 


The  proof  is  straightforward  by  the  Euclidean  topology  and  the 
linear  structure  of  (1)  for  each  given  u.  Since  N(p)  C K C Rn+* 

and  0 < x°(u)  < p,  one  can  easily  establish  the  closedness  and 
boundedness  of  the  set 

Eq  *{  x(T;u)  su  e L2  ( [t( 
the  lemma. 


u (t)dt  _<  p},  which  proves 


Therefore,  for  any  nested  sequence  N(n)  C N (n+l)C* • • , 

00 

K ■ U N (n) . The  compactness  of  N(p)  for  each  given  positive  p 
assures  the  existence  of  an  optimal  solution  to  the  system  (1)  and  (2) 
as  given  in  the  next  theorem. 


Theorem  1:  (Existence) 

Given  the  system  (1) , there  exists  an  optimal  input  function 
2 

u*  e L ([tQ,T],R)  which  minimises  the  cost  function  (2). 


By  definition  of  G, 

G(x°(u) , x (T;  u) ) > 0 


< 00 , such  that 


We  can  always  choose  a proper'  M such  that 

0 <_  igf  G(x^  (u)  ,x(T;u) ) < M.  Therefore,  there  always  exists 

(x° (Uj) ,x (Tju^) ) e K such  that  G (x° (u^) fXtTju^) ) < M and 

x°  (u^  <,  p0*(ui(t)  ■ 0 is  an  example).  Hence,  the  set  N(pQ) 

defined  by  N(pQ)  * { (x° (u) ,x(T;u) ) e K:  0 < x°(u)  < pQ, 

2 

u e L ( Itg,T] ,R) } is  nonempty.  Furthermore,  by  Lemma  1,  N(pQ)  is 
compact  in  Rn+^.  The  continuity  of  G on  N (Pq)  can  be  easily 
established  which  assures  that  G attains  its  minimum  in  N(pQ). 

This  will  imply  the  existence  of  an  optimal  input  u*  which  minimises 
the  cost  function  (2)  because  outside  N (p^ ) the  cost  is  always 
greater  than  M as  shown  in  (6).  o.E.D, 


For  instance,  choose  M>  l*A(T,t0)xQl 'Q*A(T,t0)x0 
is  the  transition  matrix  associated  with  A j then 
i£f  G(x° (u)  , x(T;u) ) < I*A(T,t0)x0]  'Q*A(T,t0)x0  < N 
intermediate  term  is  the  cost  when  u(t)  - 0. 


where  ♦ 


7. 


In  general,  for  the  multi-input  bilinear  system 


e 

X * (A  + ][  B . U • ) X 

i=l  1 1 


u (t ) » (u1#  •••»%)  ' 


where  A,  B^ (i*l, ... ,m)  are  n x n constant  matrices; 
u e L2 ( ftp, T] ,Rm) , the  class  of  m-vector  valued  square  integrable 
functions  on  ItQ,T],  and  the  associated  quadratic  cost 

fT 

J2(u)  - x'  (T)Qx(T)  + I [x‘  (t)W(t)x(t)  + u'  (t)R(t)u(t)  ]dt  (8) 

^ A 


where  Q is  as  defined  in  (2);  W(t)  is  a n x n symmetric  matrix- 
valued function  continuous  in  t and  nonnegative  definite  for  each 
t £ [t0,T] ; R(t)  is  a m x m symmetric  matrix-valued  function  con- 
tinuous in  t and  positive  definite  for  each  t e [tQ,T],  a similar 
existence  result  can  be  derived.  We  state  this  in  the  next  theorem. 


Theorem  2: 

There  exists  an  optimal  input  function  u*  e L2 ( [tQ,Tl ,Rm)  which 
minimizes  the  cost  given  by  (8)  subject  to  the  constraint  (7). 


Proof: 


8. 

we  can  construct  the  set  N(pQ)  similar  to  that  in  Theorem  1.  The 
closedness  of  N(Pq)  follows  from  the  same  construction  of  a con- 
vergent subsequence  u^  (weakly)  and  x(t,um).  The  boundedness  of 
N(Pq)  and  continuity  of  G on  N(pQ)  cam  be  shown  by  a similar 
argument  as  in  Theorem  1. 

The  existence  results  derived  here  allow  us  to  apply  the  Maximum 
Principle  in  obtaining  a two-point  boundary  value  problem  and  inves- 
tigate its  solution  either  by  analytic  methods  or  numerical  schemes. 
The  minimum  energy  control  of  a class  of  bilinear  systems  in  which 
the  system  matrices  A and  B^  commute  with  each  other  will  be 
discussed  in  the  next  section. 


III.  Minimum  Energy  Control  of  Comnutative  Bilinear  Systems 
Definition  1: 

The  system  (7)  is  called  a commutative  bilinear  if 

every  pair  of  the  matrices  (Af  B^Bj, . . ,Bm)  commute  with  each  other. 

The  commutative  bilinear  system  has  been  studied  by  Sussmann  [7] 
in  which  it  is  shown  that  the  attainable  set  of  this  system  with 
bang-bang  controls  is  closed.  Baras [12]  recently  extended  this  re- 
sult to  the  delayed  commutative  bilinear  system.  The  problem  con- 
sidered here  is  the  minimum  energy  control  of  a commutative  bilinear 
system  without  control  constraint.  It  will  be  shown  that  this  pro- 
blem has  a simple  solution  which  possesses  an  easily- implemented 
character . 


Two  kinds  of  cost  functions  are  investigated  here. 


S'  'MHM'WwywMHuni  Jtw 


9 


rT 

Jx(u)  - X' (T)Qx(T)  + j u'(t)Ru(t)dt 


without  a terminal  constraint  on  the  state,  and 


ft 

J2(u)  = j u'(t)Ru(t)dt  with  x(T)  » xx 


where  x^^  is  a prespecified  vector. 

Obviously,  the  problem  associated  with  the  cost  (10)  involves 
the  attainability  of  the  given  system  to  the  desired  target  set. 


A.  Without  Terminal  Constraint 
Lemma  2 ; (Pontryagin) 

The  optimal  solution,  if  it  exists,  of  the  system  (7)  with  cost 
(9)  satisfies  the  following  two-point  boundary  value  problem 


m * 

x*  - (A+  l B.u. )x* 
i*l  1 x 


X<V  * xo 


ro  . * 

p*  = -(A'+  I B.u. ) p*  p(T)  * -Qx (T) 
i*l  11 


u*(t) 


1 R-1 

J R 


* • l 

x B, 


*•  • 

BnJ 


This  is  a special  case  of  the  Maximum  Principle  applied  to  the 


where  *. (t,trt)  is  the  state  transition  matrix  associated  with  the 


The  existence  of  optimal  controls  follows  from  Theorem  2.  Then 
Lemma  2 implies  that  the  optimal  solutions  satisfy: 


m * 

(A+  l B.u.)x*  , 

i-1  1 1 


X<V  « xQ 


-(A'+  l B. u^)p*  , p(T)  ■ -Qx(T) 

i-1  1 1 


u*(t)  = 


1R-1 


fx*'  B.  p*' 


x*'  b'  p* 
m 


Let  R = 


= (Ylf...fYm)  where  y^  = ^Yii' • • • ' » then  a direct 


substitution  implies 


£ I ^ I 

ui(t)  = il1Yji(lt*'Bjp*) 


i = 1, 2, . . .m 


Differentiate  (15)  with  respect  to  t and  substitute  (14)  into  it. 


3t  ui(t)  = 7 ^ Yji[i*,BjP*  + x*'B'p*l  - 0 


, i — 1,2,  ..  .1 


where  the  commutivity  assumption  is  strongly  used.  Therefore, 


ui(t)  » ui  = constant,  i - 1,2,..., in. 


With  this  constant  input  function,  the  solution  of  (14)  can  be 
written  as 


^ ~~~ 1 1 — — 


x*(t)  - *i^t#t0^x0 


P*  (t) 


♦A(T,t)  n *4(T,t)p(T) 

A i»l  X 

-♦j(T.t)  jf^  ^(T,t)Q*A(T,t0) 


from  which  the  desired  expression  for  u*  can  be  derived.  Q.E.D. 

Theorem  3 states  the  simple  character  of  the  minimum  energy 
control  problem  for  commutative  bilinear  systems,  i.e.  the  optimal 
controls  are  simply  constant  vectors  which  satisfy  the  algebraic 
equation  (13).  This  fact  enables  us  to  treat  the  optimal  commuta- 
tive bilinear  system  as  a fixed  linear  system  for  which  the  explicit 
solution  is  immediately  available.  This  result  also  holds  for  a 
slightly  more  general  system  in  which  A - A(t)  is  a n x n time- 
varying  matrix  which  commutes  with  Bi (i-1, . . . ,m)  because  in  the 
proof  the  time-dependence  of  A does  not  play  any  role. 

In  order  to  obtain  u*  from  the  nonlinear  algebraic  equation 
(13)  by  iterative  schemes,  it  is  interesting  to  study  the  uniqueness 
property  of  the  solution  to  (13) . Next,  we  will  use  the  property 
of  monotonically  increasing  maps  to  show  the  uniqueness  of  solutions 
to  (13)  for  a class  of  commutative  bilinear  systems. 


I 


Definition  2t 


A continuous  map  M from  R into  itself  is  monotonically  increa 
ing  if  xx  > x2  implies  M(xx)  > M(x2)  for  all  xr  and  x2  in  R. 


Given  a monotonically  increasing  map  H,  then  the  solution 
to  the  equation  x + M(x)  - 0 is  unique. 

Proof: 

The  proof  is  straightforward  and  left  for  the  reader  as  an 
exercise. 

Theorem  4: 

There  is  a unique  optimal  solution  to  the  single- input  (m-1) 
commutative  bilinear  system  (7)  with  cost  (9)  if  the  matrix 

t I 

Bl®  + B1®B1  is  nonnegative  definite. 


I 


by  hypothesis  that  B1  Q + B ^qb1  is  nonnegative  definite.  Hence 
M is  monotonically  increasing  and  Lemma  3 gives  the  uniqueness 
of  the  optimal  control  u*  to  (17) . Q.E.D. 


The  situation  turns  out  to  be  much  more  complicated  in  the 
multi-input  case.  One  can  generalise  the  concept  of  a monotonically 
increasing  map  into  the  m-dimensional  Euclidean  space  as  follows. 


Definition  3: 

A continuous  map  6 from  Rm  into  itself  is  monotonically 
increasing  if  <x1-x2,G(x1)-G(x2»  > 0 for  all  xx  and  x2  in  tP, 
where  <•,•>  is  the  usual  inner  product,  i.e.  <x,y>  ■ x'y. 

We  have  a similar  lenma  as  in  the  scalar  case. 

Lenina  4: 

Given  a monotonically  increasing  map  G,  then  the  solution  of 
the  equation  x + G(x)  ■ 0,  x e Rm  is  unique. 


Proof: 


Again,  the  proof  is  left  to  the  reader. 


Theorem  5: 


There  is  a unique  optimal  solution  to  the  commutative  bilinear 
system  (7)  with  cost  (9)  if  the  matrix  R_1Z(vQ)  is  nonnegative 
definite  for  all  vn  in  Rn,  where  Z(vn)  * (Z,  .)  and 


K.  . 


til  • 


Zi;J  a Vq  (BjBj^Q+B^QBj  ) Vq  i i»  j = 1/2,.  ..in. 


Proof : 


The  proof  is  similar  to  that  of  Theorem  4 but  with  some  modifi- 
cation. Here,  we  define  G(u*)  = ^ R_1g  where 


9j  - yo^tT'V^  *i(T,t0)BjQini  ♦i(T,t0)*A(T,t0)y0,  j - 1 m 


(u*;h)  - [|  R“1Z(y0)].h  h e Rm 


is  the  Frechet  differential  of  G at  u*  with  increment  h,  and 


t I I 


!ij  - y0,*<T't0»)t;i  VT'VtB3BiQ+BiQBj1  ^ VT'Wr'V»0' 


By  hypothesis,  R Z(Vg)  > 0 which  implies  that  G is  monotonic- 
ally  increasing.^  Therefore,  Lemma  4 gives  the  uniqueness  result. 


Because  R xZ(Vg)  > 0 implies  that  the  Frechet  derivative  of  G 
is  monotonically  increasing,  which  implies  the  monotonicity  of  G, 
see  Vain berg  [14] . 
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B.  With  Terminal  Constraint 

So  far,  we  have  dealt  with  the  minimum  energy  control  problem 
which  includes  a terminal  cost  as  defined  in  (9).  Many  times  the 
minimum  energy  control  problem  is  considered  with  some  terminal 
constraint  instead  of  a quadratic  cost  term,  e.g.,  one  may  be  only 
interested  in  driving  a given  dynamical  system  to  a certain  desired 
target  set  at  some  terminal  time  (either  fixed  or  free)  while  the 
total  control  energy  spent  is  minimized.  In  other  words,  the  mini- 
misation of  control  energy  is  taken  over  the  set  U of  admissible 

c 

controls  which  consists  of  those  input  functions  which  do  steer  the 
given  system  to  the  desired  target  set  at  a certain  finite  time. 

On  the  other  hand,  a bilinear  system  does  not  generally  have  the 
global  controllable  property  as  a linear  system  which  has  been  quite 
thoroughly  characterised.  Therefore,  we  shall  limit  our  attention 
only  to  the  reachable  zone  Z associated  with  each  bilinear  system 
rather  than  the  whole  space  Rn  as  the  target  set  when  we  are  deal- 
ing with  the  cost  (10) . 

Definition  4> 

A set  Z(xQ;U)  is  called  a reachable  set  associated  with  (7)  if 

Z(xQ;U)  - jx(T)eRn:x(t0)  - xQ,  u(t)eU,x(t)  satisfies  (7)  in  some| 

finite  time  interval  [tQ,T] . 

A *et  S (D;U)  is  called  a reachable  zone  associated  with  (7)  if 


Z(D,U)  - U Z(x0;U). 
x0eD 


17. 


Definition  5: 

System  (7)  is  reachable  to  x1  with  respect  to  xQ  and  U 

if  there  is  an  input  u(t)  e U which  steers  it  from  xrt  to  x, 

at  some  finite  time  T.  System  (7)  is  constant  reachable  to  x1 

with  respect  to  x0  if  there  is  a constant  input  function  u 

w c 

which  steers  it  from  xQ  to  x1  at  some  finite  time  T. 

The  following  theorem  states  an  interesting  property  regarding 
the  reachability  of  commutative  bilinear  systems. 

Theorem  6: 

The  commutative  bilinear  system  (7)  is  reachable  to  with 

respect  to  xQ  and  L2 ( [tQ,T] ,Rm)  if  and  only  if  it  is  constant 
reachable  to  x^  with  respect  to  Xq. 

Remark: 

The  Theorem  assures  that  if  u(t)  e L2 ( [t0,T] ,r“)  steers  the 
commutative  bilinear  system  from  xQ  to  x1  at  T,  then  there 
exists  a constant  input  function  uc  which  can  do  the  same  job 
as  well.  This  enables  us  to  study  the  commutative  bilinear  system 
with  a class  of  simple  easily  implemented  input  functions , namely, 
the  constant  input  functions. 

Proof: 

The  sufficiency  part  is  true  by  definition.  To  show  the  neces- 
sity part,  suppose  u(t)  e L2 ( [tQ,T] ,Rm)  steers  (7)  from  xQ  to 
x^  at  some  finite  time  T.  If  we  can  construct  a constant  input 
uc  which  also  steers  (7)  from  xQ  to  x1  at  T,  then  we  are  done. 


Since  each  pair  of  (AfB^i-l, . . .m)  commute  with  each  other, 
the  solution  of  (7)  can  be  expressed  as: 


Choose  u„  such  that 


then  obviously 


Again,  this  result  holds  for  a slightly  more  general  bilinear 

system  in  which  A * A(t)  is  time-varying  provided  (A(t)  ,Bi;i»l, . . 

commute  with  one  another.  For  this  class  of  bilinear  systems,  the 

reachable  sone  is  much  easier  to  characterise  because  one  need  only 

consider  a constant  input  u_  as  a set  of  parameters,  then  compute 

the  corresponding  transition  matrices  * (t,tn) , i * l,...m,  which 

ci  0 

characterise  the  reachable  sone  of  the  given  system  (7)  with  a set 
D of  initial  conditions. 


Bearing  this  concept  of  reachable  sone  in  mind,  we  may  consider 
the  minimum  energy  control  problem  (10)  of  the  commutative  bilinear 
system  (7)  associated  with  a terminal  constraint  x(T)  ■ x. , some 


»;  " 
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a priori  specified  final  state.  The  existence  of  a constant  optimal 
control  is  derived  via  a different  approach  than  that  used  in 
Theorem  3.  The  constant  optimal  control  is  determined  through  the 
boundary  conditions.  Then  one  example  is  given  to  show  the  non- 
uniqueness of  optimal  control  in  general. 

Theorem  7: 

Given  a commutative  bilinear  system  (7)  with  the  cost  (10).  If 

x1  belongs  to  the  reachable  set  Z (xQ»L2 ( [tQ,T] ,Rm) ) , then  there 

* 

exists  a constant  optimal  control  u which  steers  (7)  from  xn 

C 0 

to  x1  at  T and  minimizes  the  associated  cost  (10) . Furthermore, 
* 

uc  satisfies  the  nonlinear  algebraic  equation: 

X1  ■ ♦ci(T'to,xo-  (20) 

Proof : 

2 m 

Suppose  U C L ( [tQ ,T] ,R)  is  the  set  of  admissible  controls, 
i.e.,  0 - (u(t)eL2([t0,T],Rm) :*A(T,t0)  U (T, tQ)  - x J + 0, 


by  hypothesis.  By  Theorem  6,  there  exists  a nonempty  subset  E con- 
sisting of  all  elements  of  U which  are  constant  input  functions. 

For  each  u(t)  e U,  there  is  a u_  e E such  that 

c 


■■ 


r.  ^ 1 * If  • • «iQf  then 


£ »>C  dt  ■ J ri»Ci  2(T-t0>  * °1< 


r ■ 

Jt" 


(t)Ru(t)dt 


m fT 

I rif 

i“l  jtA  i 


(t)dt  - U2. 


In  order  to  show  that  Ux  < U2,  it  suffices  to  show  that 


fT 

uc  2(T”t0)  - j ui2<t>dt  » i » 1. 
^ tA 


But  this  is  true  because 


“ct2«-V2 


fT  fT! 

(J  lucjdt)2  - (J  l«i<t)  |dt)2  < j ui2(t)dt(T-t0) 


i ■ l,...m,  where  the  last  part  is  due  to  Hdlder's  inequality. 
Moreover,  the  equality  is  achieved  if  and  only  if  u^t)  - c,  a 
constant.  Therefore,  if  the  minimum  energy  exists,  it  must  be  in- 
curred by  a constant  input  u*  e E.  But  this  is  indeed  the  case 

v 

because  the  cost  is  continuous  in  uc,  it  attains  its  minimusi  in 


son  compact  subset  of  E. 

Since  optimal  controls  are  in  constant  fonts,  they  satisfy  (20) 
by  a direct  computation  of  the  solution  of  the  'linear'  fixed  system 


Q.E.D. 
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Finally,  we  consider  a second  order  single-input  commutative 
bilinear  system  as  an  example  to  show  the  non-uniqueries s of  the 
optimal  control. 


Given  a bilinear  system 


1 0 


u,  x (0) 


,x(l) 


0 1 


-1  0, 


with  the  cost  to  be  minimized: 


J(u)  - f u 
J0 


2 (t)dt  , u (t ) e L2(t0,l] ,R) . 


It  can  be  easily  seen  that  (A^B^)  commute;  hence,  by  Theorem  7 

* 

an  optimal  control  u satisfies  the  algebraic  equation: 

C 


[cos  u*  sin  u*l 


[e1sin  u*l 


l-sin  u*  cos  u*j 
v c cJ 


[e1cos  u*j 
*■  cJ 


* * 

Solving  for  u , we  obtain  u * kir,  k - ±1,±3...  In  order  to 
c c 

have  the  total  control  energy  minimized,  the  integers  with  the 

smallest  absolute  value  are  chosen,  i.e.  ^ ■ 1,  kj  ■ -L  Clearly, 

* * 2 
u^  * it  and  u2  = -x  both  incur  a minimal  cost  of  x and  steer 


:v,£Vr€? 
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the  given  system  to  the  desired  final  state.  Hence  both  u*  and 
* 

U2  are  optimal  controls. 

The  lack  of  uniqueness  of  the  minimum  energy  control  problem  for 
a commutative  system  associated  with  a terminal  constraint  is, in 
general, expected  because  the  nonlinear  two-point  boundary  value 
problem  generally  does  not  have  a unique  solution  which  prevents  the 
uniqueness  of  optimal  solutions  in  many  cases. 


IV.  Application  to  a Two-Dimensional  Missile  Intercept  System 

It  is  assumed  that  for  a typical  high-speed  pursuing  missile 
and  short  initial  range,  the  maneuvering  of  the  vehicles  can  be  res- 
tricted to  a two-dimensional  plane.  Choose  the  coordinate  system 
fixed  in  the  missile  as  shown  in  Figure  1.  Denote  the  angular  rate 
of  the  missile  and  the  target  with  respect  to  a non-rotating  reference 
coordinates  as  u and  uT,  respectively. 


I 


The  equations  of  motion  are  then  described  by  [111 

*1  = “VT  sin  x3  + x2up 

x2  = vT  cos  x3  - x1up-vp  (21) 

x3  = uT  - up 

where  vT  and  vp  are  the  line  speeds  of  the  target  arid  the  missile 
relative  to  air;  x1  and  x2  are  the  horizontal  and  vertical  dis- 
tance from  the  missile,  and  x3  is  the  relative  angle  between  the 
headings  of  the  missile  and  target  measured  counterclockwise. 

The  system  (21)  can  be  transformed  into  a homogeneous  bilinear 
system  by  introducing  three  auxiliary  states:  x4  « sin  x3,x5  = cos  x. 

and  Xg  = 1.  That  is, 

x = Ax  + Bxu  (22) 


with 


0 

0 

0 

-VT 

0 

0 ' 

’ 0 

1 

0 

0 

0 

o' 

’ X1<V' 

0 

0 

0 

0 

VT 

-v 

P 

-1 

0 

0 

0 

0 

0 

X2(V 

0 

0 

0 

0 

0 

\ 

, B = 

0 

0 

0 

0 

0 

-1 

, x(t0)  * 

x3(v 

0 

0 

0 

0 

UT 

0 

0 

0 

0 

0 

-1 

0 

sinx3(tQ) 

0 

0 

0 

"«T 

0 

0 

0 

0 

0 

1 

0 

0 

cosx3(tQ) 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0. 

1 

in  which  u = u is  defined  as  the  control  variable 


' 
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The  objective  is  to  find  a square  integrable  function  u*(t) 
which  steers  the  missile  to  the  target  at  some  finite  time  T (a 
free  time  formulation)  while  the  total  control  energy  consumed  over 
this  interval  [tg,T]  is  minimized.  The  following  performance 
index  is  thus  considered: 

J (u)  = | u2(t)dt  , T > tg  (24) 

tg 

subject  to 


X1(T)  = x2(T)  = 0 (25) 

Apart  from  the  classical  proportional  navigation  guidance  law, 
a crucial  assumption  is  proposed  relative  to  the  line  speed  v of 
the  missile  which  leads  to  an  explicit  solution  to  the  afore-mentioned 
problem.  That  is,  vp  is  taken  to  be  proportional  to  (an  amplifica- 
tion of)  up  with  the  proportional  parameter  y to  be  determined 
by  the  boundary  conditions,  provided  up  is  non-zero  of  course. 

Hence,  we  assume 


VP  = YV 


(26) 


With  this  condition.  Equation  (23)  becomes 
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solutions  with  the  minimum  energy  problem.  Consideration  should 
be  given  to  two  separate  cases  in  which  uT  is  zero  and  non- zero. 


A.  Non-zero  Angular  Maneuver  of  the  Target 

From  Equation  (28) , denote  the  intercept  angle  by  8#  i.e. 

fT 

x3(T)  * x3(tQ)  - j u(s)ds  + u,(T-t0)  - 8. 


The  terminal  constraint  (25)  becomes 


[xl(t0)+Y]cos{uT(T-t0)  + x3(t0)-8>  + x2(t0)sin{uT(T-t0)+x3(t0)-8}-Y' 
-IXiltQj+YliiniUjtT-tQ)  + x3(t0)-8>  + x2 (t0)cos{uT(T-t0)+x3 (tQ)-B} 


(cos  6 - cos (T-t0)-61 


[sin[ufp(T-t0)-B]  + sin  Bj 


In  other  words,  the  terminal  constraint  problem  of  the  differential 
system  (22)  and  (27)  has  been  reduced  to  solving  a pair  of  nonlinear 
algebraic  equations  (31)  of  transcendental  type  for  an  appropriate 
set  (y,8,T).  A solution  often  exists  for  this  case  in  which  the 
number  of  unknowns  exceeds  the  number  of  equations. 

The  next  proposition  shows  the  existence  of  a triple  (y,B,T) 
which  solves  (31)  for  every  initial  condition  (xx (tQ) ,x2 (tQ) ,x3 (tQ) ) 
in  R3,  the  analytic  expression  for  this  triple,  as  well  as  the  proof 
of  the  proposition,  is  given  in  Wei  [15]. 


• ■'7’*  -'•’i  J " w * I* 


Proposition  1: 

If  a constant  but  non-zero  angular  maneuver  of  the  target  is 
assumed,  then  there  exists  a triple  (y,0,T)  satisfying  (30)  and 
(31)  which  solves  the  terminal  constraint  problem  (25)  for  every 
(*i (tQ) ,x2 (t0) ,x3(t0) ) e R3.  The  corresponding  proper  control 
action  satisfies  (30) . 


B.  Zero  Angular  Maneuver  of  the  Target  (u^O) 

In  a similar  manner,  the  terminal  constraint  (27)  becomes 


Ixl(t0)+YlcosIx3tt0)"el  + x2<to)sinIx3^to)"63  ~Y  “vT(T“tQ)sin  0 

(32) 

”Ix1(t0)+Ylsin[x3(t0)-Bl  + x2 (tQ)cos [x3 (tQ)-0l  + vT(T-t0)cos  0 

where  0 is  as  defined  in  (30)  but  with  * 0.  Again  a proper 
triple  (y » B,T)  needs  to  be  solved  from  this  equation.  The  following 
proposition  indicates  the  region  in  which  every  initial  condition 
will  introduce  a feasible  solution  for  the  triple.  The  proof  is 
given  in  [15]. 

Proposition  2: 

If  a zero  angular  maneuver  of  the  target  is  assumed,  then  there 
exists  a triple  (y,£,T)  which  solves  the  terminal  constraint  problem 
(25)  for  every  (xx (tQ) ,x2 (tQ) ,x3 (tQ) ) e R3  \ E,  where 

E » {(0,y,z)  e R3s  either  y > 0,  z ■ (2k+l)ir,  or  y < 0,  z ■ 2kir; 

k - 0,±1, ...}  . 


It  can  be  easily  shown  that  the  system  (22)  and  (27)  is  a com- 
mutative bilinear  system.  Thus,  the  results  in  section  III  are 
applicable  to  the  minimum  energy  problem  (24)  and  (25) . The  set 
Uc  of  admissible  controls  in  this  case  includes  those  input  functions 
u (t)  satisfying  the  algebraic  equations  (30)  and  (31)  or  (32).  The 
next  proposition  which  gives  the  explicit  form  of  the  optimal  con- 
trols to  the  problem  (24)  and  (25)  is  a direct  consequence  of  Theorem 
7. 


Proposition  3: 

Given  the  system  (22)  and  (27) , there  exists  an  optimal  control 
u*  e Uc  which  minimizes  the  cost  (24)  and  steers  the  system  to 
xx(T)  * x2(T)  = 0 at  some  T > t0  for  each  appropriate  set  of  initial 
conditions  (x^tg),  x2  (tQ)  ,x3  (tQ)  ,u<r,vT) . This  control  is  given  by 


u*  (t) 


*T 


x3(t0)_e 


(33) 


where  T and  3 are  given  as  discussed  in  Propositions  1 and  2. 


Proof:  Theorem  7 implies  the  existence  of  constant  optimal  controls 

u*  e UQ.  By  Equation  (30),  u*  is  given  as  in  (33). 

The  striking  character  of  this  optimal  control  law  in  a constant 
form  is  not  completely  without  expectation  because  the  control  aspects 
proposed  here  include  two  channels.  One  is  the  angular  maneuver  of 
the  missile  which  counterbalances  (offsets)  the  angular  maneuver  of 
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the  target,  while  the  additional  degree  of  freedom  introduced  by 
Y carries  out  the  major  pursuit  part  of  the  problem  and  leads  to 
a simple  solution  having  some  intuitive  sense. 

On  the  other  hand,  it  should  be  noted  that  although  an  optimal 
control  is  in  a constant  form  (a  step  function),  a sub-optimal  con- 
trol law  can  always  be  constructed  which  will  drive  the  missile  to 
the  target  at  some  T,  with  an  appropriate  intercept  angle  3 and 
for  some  ratio  y as  long  as  the  area  swept  out  satisfies  (30). 
This  allows  the  control  engineer  a great  deal  of  flexibility  in  the 
design  of  a feasible  easily  implemented  sub-optimal  controller. 


V.  Conclusion 

This  paper  deals  with  the  bilinear  regulator  problem  from  a 
theoretical  point  of  view.  The  existence  of  optimal  controls  for 
the  quadratic  cost  problem  associated  with  a bilinear  system  is  es- 
tablished. A class  of  commutative  bilinear  systems  and  the  minimum 
energy  problem  associated  with  it  are  specially  investigated.  This 
problem  has  some  interesting  features  in  which  the  optimal  controls 
are  in  the  form  of  a constant  vector  determined  by  the  boundary  con- 
ditioni.  On  the  other  hand,  as  far  as  the  terminal  constraint  pro- 
blem is  concerned,  the  system  has  a variety  of  different  controls 
to  be  applied  in  reaching  the  desired  terminal  state  provided  these 
controls  satisfy  an  area  condition.  This  has  significant  meaning 
from  the  design  point  of  view.  Sufficient  conditions  for  the  unique* 
ness  of  the  minimum  energy  control  are  derived  for  this  problem  with* 
ou^  * terminal  constraint.  Application  to  a two-dimensional  pursuit* 
evasion  problem  is  considered  in  which  the  analytical  expression  of 
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the  optimal  control  is  obtained.  This  illustrates  the  simplicity 
in  structure  of  the  controllability  and  minimum  energy  problem  for 
commutative  bilinear  systems. 

The  problems  yet  to  be  solved  include  the  construction  of  a 
sub-optimal  closed-loop  control  law,  the  estimation  of  desired 
states  either  in  a deterministic  or  a stochastic  sense,  and  the 
structure  of  the  optimal  controls  for  different  bilinear  systems 
with  different  performance  functions.  Another  interesting  problem 
which  is  currently  investigated  is  the  singularly  perturbed  com- 
mutative bilinear  systems.  An  example  is  given  in  (15]  which  indi- 
cates the  possiblity  to  solve  the  associated  optimal  control  problem 
analytically.  In  addition  to  this,  the  stability  of  the  optimal  bi- 
linear system  should  also  be  examined  because  the  constant  optimal 
control  enters  the  system  directly  in  a feedback  form  with  an  adjust- 
able gain  matrix. 
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